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Clostridium difficile focussed antihftdli«>{^ 

The present invention is concerned with antibodies either specific for and which confer 
immunity against infection by Clostridium difficile or antibodies produced after challenge 
with a C. difficile based vaccine, methods of identifying same, and uses of same. 

C. difficile is a gram-positive anaerobic bacterium, and is deemed a significant himian 
pathogen causing a spectrum of diseases ranging from mild diarrhoea to ftihninant 
pseudomembranous colitis (PMC) - collectively referred to as C. difficile antibiotic- 
associated diarrhoea (CDAD). CDAD is a common, iatrogenic, nosocomial disease 
associated with substantial morbidity and mortality, especially in the elderly. Two factors 
have been assigned main roles in the pathogenesis of CDAD - the suppression of the 
resident intestinal flora by the administration of antibiotics, and the production by the 
bacterium of two high molecular weight toxins, exotoxin A and exotoxin B. 

The bacterium is endemic in hospitals, and studies have shown that approximately one 
third of patients receiving antibiotic treatment in acute-care medical wards were colonised 
by C. difficile while in hospital (Kyne, L., et al., 2002, Clin. Infect. Dis. 34(3), pp346-53, 
PMID: 11774082). Of these patients, over half went on to develop CDAD while the 
remainder were symptomless carriers. CDAD is a major factor in extension of patient 
hospital stay times, and estimates suggest that the cost of this disease in the US exceeds 
$1.1 biUion per year (Kyne, L., et al.. Supra). Patients suffering from CDAD respond weU 
to a treatment which includes a discontinuation of the inciting antibiotic and treatment with 
either of the antibiotics metronidazole and vancomycin. However, the use of e.g. 
vancomycin is one of last resort since it is associated with several problems. Not only may 
it cause nephrotoxicity, ototoxicity, bone marrow toxicity and the red man syndrome, but 
the problem with this treatment regime is that the CDAD often returns after successfiil 
treatment of the initial episode, and this reoccurrence represents a serious clinical problem. 
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Additionally, there is evidence that C. difficile is becoming resistant to metronidazole and 
partially resistant to vancomycin, demonstrating the need for new alternatives in the 
treatment of CDAD. 

5 Exotoxins A and B which are produced by pathogenic strains of the bacteriimi are 
cytotoxic, enterotoxic and proinflammatory, and are considered to be the main viralence 
factors of this non-invasive microorganism. However, not all infections with toxigenic 
strains result in disease, prompting the search for additional vimlence factors. Bacterial 
surface expressed antigens represent candidate virulence factors, and are also considered 

10 important since such proteins likely mediate the essential functions such as adhesion to the 
epithelial layer of the gut in the first step of colonization or interaction with mediators of 
local immxmity. In common with many other bacteria, C difficile expresses a crystalline 
or paracrystalline surface layer (S-layer) on the outer cell surface. Such S-layers comprise 
proteins or glycoproteins forming a regularly arranged lattice on the extemal surface of the 

IS bacterium, and have previously been shown to be essential for the virulence of pathogens 
such as Aeromanas salmonicida and Campylobacter fetus. In contrast to most bacteria 
which comprise one S-layer, C difficile is known to comprise two superimposed 
paracrystalline S-layers, each composed of a glycoprotein subunit which varies slightly in 
apparent molecular weight among different C. difficile strains. Most strains of C difficile 

20 express two major S-layer proteins (SLPs), one of 32-38 kDa (low-MW SLP) and a second 
of 42-48 kDa (high-MW SLP). The low-MW SLP appears to be immunodominant and is 
the antigen most commonly recognised by patients suffering from CDAD, and is the only 
antigen recognised in EDTA extracts of bacteria by antisera raised in rabbits against whole 
C. diffiicile cells (Calabi, E. et al., 2001, Mol, Microbiol., 40(5) pi 187-99, PMBD: 

25 11401722). 

During the course of microbial infection various adaptive strategies are employed by the 
inuxmne system. One such strategy, and arguably the most important, is the production of 
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an antibody response. Antibodies capable of binding antigens displayed by the infectious 
agent are produced and bind to and allow killing of the microorganism trough 
complement activation, recruitment of macrophage and through direct interaction with the 
microbe itself. The therapeutic efficacy of antibodies capable of binding a given antigen 
5 varies and this is reflected in the fact that antibody production by the immune system 
matures during the course of an infection and becomes focussed in the case of a patient 
successfully fighting off an infection. 

The antibody response is elicited by the B cell repertoire where individual B cells each 
10 produce structurally diverse antibody molecules. The actual size of this B cell/antibody 
repertoire is unknown, but it is estimated that the random clonal firequency of reactivity for 
a given antigen may be as high as 1 in 100,000 in cultured B cells (Nobrega, A., et al., Eur 
J Immunol. 1998 Apr;28(4): 1204-15; PMID: 9565360). During the course of infection, 
antibodies capable of binding the pathogen are selected for by changes in the B cell 
IS population resulting in key antibodies being produced in large numbers. The mechanisms 
for these changes include clonal expansion, isotype switching, and somatic mutation of 
immunoglobulin variable regions. B cells responsible for generating antibodies which are 
able to bind a pathogen multiply, thus skewing the B cell repertoire and changing the 
proportions of B cells, 

20 

Non-antibiotic based therapeutic regimes for the treatment/prevention of C. difficile 
infection are based upon vaccination and passive immunization. Vaccination treatment 
comprises administering to a patient either a nucleic acid sequence encoding an 
immunogenic fragment of the C difficile surface layer protein or a variant or homologue 
25 thereof, or an equivalent polypeptide fragment (as disclosed in WO 02/062379). Passive 
itmnxmotiierapy is typically achieved by administering to a patient a monoclonal antibody 
specific to an immunogen produced by a pathogen. In general, passive immunotiierapy is 
particularly effective in treating immunocompromised patients who are unable to respond 
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to vaccination, and to patients who need immediate therapy and cannot wait for 
vaccination to take effect. In the case of a C. difficile infection, passive hmnunization 
relies on the administration to a patient of toxin-neutralizing polyclonal immune globulin, 
(as disclosed in WO 99/2030.4), or antibodies raised against the whole bacterium and the 
toxins (as disclosed in WO 96/07430). 

Thus, as can be seen from the prior art, in order to effect treatment of the patient, an 
immunogen is first isolated and purified, administered to test animals, and cells pnxlucing 
antibody specific against the immunogen cloned. The range of antibodies produced by the 
clones can then be tested for their therapeutic efficacy and a single monoclonal antibody 
selected which is then administered to patients to effect passive immunotherapy. 

There are, however, several problems associated with current passive imraunotiierapy 
regimes aimed at treating C. difficile infections. For example, passive immunotherapy 
requires that there are survivors of the C difficile infection and patients who have been 
vaccinated. Each batch of antibody can be different leading to difficulties associated with 
standardisation and administration of the immunolherapeutic reagent. In addition, the 
problem of inadvertent administration to a patient of adventitious agents (e.g HIV, HBV, 
HCV, or as yet unidentified agents) is a real one, and up-to-date screening of any 
immunotherapeutic reagent is required. Finally, the strain variabiUty exhibited by C 
difficile means that a given antibody may only be useful against certain strains of the 
bacterium and not against others. 

Obviously the techniques involved are somewhat complex, inconvenient, expensive and 
time consimiing. In general they require that an immunogen is isolated from the infecting 
pathogen and used to generate antibody. Simply isolating the immunogen can be extremely 
difficult and time consuming, particularly if it comprises carbohydrate or complex non- 
linear epitopes (i.e. epitopes having secondary, tertiary and/or quatemary structural 
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features) which cannot be synthesised in vitro, and are impossible to isolate and produce 
as an antigen for use as e.g. a vaccine. The SLPs of C. difficile contain a glycoprotein 
subunit which varies in molecular weight between species. It may be the case that C. 
difficile epitopes are only produced in vivo and are not synthesised in vitro, consistent widi 
for example Neisseria gonorrhoeae infections (the causal agent of gonoirhea) where 
specific antigens are only expressed upon infection of a host. Such antigens fell into the 
general class of ciyptoantigens. Furthermore, C. difficile may display hi^y labile antigens 
which are difficult to work with since during use they simply degrade and the epitope they 
display is lost With regard to this range of epitopes/antigens whose identification and/or 
in vitro use is of great difficulty or impossible, the present invention overcomes these 
disadvantages and provides a solution by providing antibodies whose CDR regions have 
been generated in response to C. difficile epitopes during antibody responses of patients 
infected with C. difficile. 

In addition, the prior art typically has to attempt to achieve an equivalent of affinity 
maturation of antibodies by first synthesising a set of candidate antibodies specific to an 
antigen, testing them for their bindmg characteristics and then modifying the sequences 
of the candidates in order to optimise the binding. A thorough attempt at affinity 
maturation (i.e. optimising antibody binding) can require the synthesis of thousands of 
different antibodies, which can be costly and time-consuming. Because the antibodies of 
tiie present invention are obtained firom patients who have either been infected by a 
pathogen displaying the antigen or who have been vaccinated with an antigen, they have 
by their very nature and definition aheady undergone a£5nity maturation, as is most clearly 
demonstrated by the sequences of their CDR3 regions of the variable heavy and variable 
light chains (i.e. tiie CDR-H3 and CDR-L3 regions). 
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According to the present invention there is provided an antibody or an antigen binding 
firagment thereof having the CDR-H3 sequence selected from the group consisting of: SEQ 
ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, and SEQ ED NO: 31. 

5 Also provided according to the present invention is an antibody or an antigen binding 
fragment thereof having the CDR-L3 sequence selected from the group consisting of: SEQ 
ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34. 

Also provided according to the present invention is an antibody or an antigen binding 
10 fragment thereof having a CDR-H3 sequence selected from the group consisting of: SEQ 
ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, and SEQ ID NO: 31, and 
a CDR-L3 sequence selected from the group consisting of: SEQ ID NO: 32, SEQ ID NO: 
33, and SEQ ED NO: 34. 

15 According to the present invention there is also provided a method for identifying 
candidate sequences of at least the CDR3 region of antibodies specific against at least one 
antigen produced by C difficile during an infection or against a vaccine, comprising the 
steps of: 



(i) 



with B cells isolated from at least one patient who has been infected 
by Clostridium difficile or administered said vaccine, sequencing at 
least the CDR3 region of the VH and/or VL coding regions of said B 
cells; and 



20 



(ii) 



correlating said sequenced at least the CDR3 regions of the VH and/or 
VL coding regions of said B cells from said at least one patient to 
identify a set of candidate sequences for at least a CDR3 region of 
antibodies specific against said at least one antigen produced by 
Clostridium difficile or against said vaccine, each of said set of 
candidate CDR3 sequences or a sequence having at least 80% 



25 
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homology therewith occurring in total at a frequency of at least 1 
percent in tiie set of sequences determined at step (i). 

Examples of patients used as a source of B cells in such a method are himians and other 
mammals such as mice, rabbits, rats, baboons, monkeys and apes. 

In certain embodiments of the invention, step (i) may comprise the steps of: 

(i)(a) isolating B cells from at least one patient who has been infected by 

Clostridium difficile or administered said vaccine; and 
(i)(b) sequencing at least the CDR3 region of the VH and/or VL coding 

regions of said B cells; 

TTie sequences occurring in total at a frequency of at least 1 percent in the set of sequences 
determined at step (i) may be the candidate CDR3 sequences or a sequence having at least 
80% homology, for example 85, 90, 95, 96, 97, 98 or 99% homology therewith. Sequence 
homology is as determined using the BLAST2 program (Tatusova TA et al, FEMS 
Microbiol Lett 1999 May 15;174(2):247-50; PMID: 10339815) at the National Center for 
Biotechnology Information, USA (www.ncbi.nlm.nih.gov) with defriult parameters. 

For example, the sequences occurring in total at a frequency of at least 1 percent in the set 
of sequences determined at step (i) may be the candidate CDR3 sequences or a sequence 
having 1 or 2 amino acid changes tiierefiom. In general, tiie frequency of a dominant 
sequence is added together with the frequency of any sequence showing >80% homology, 
such that any sequence exhibiting a frequency of 1 % or greater is then deemed a candidate 
sequence. For example, a first sequence may occur at a frequency of 0.7 percent, and first, 
second, third and fourth sequences each having a single amino acid change there fix)m each 
occur at a frequency of 0.1% - the total occurrence is therefore 1.1% and the dominant 



wo 2004/094474 



PCT/GB2004/001619 



-8- 

antibody sequence (occurring at a frequency of 0.7%) is therefore a candidate CDR3 
sequence. 

This is fundamentally different to prior art methods of making a library of antibodies in a 
vector such as a phage library where antibodies are isolated by panning and which require 
binding with a specific (known) antigen in a structurally- and charge-specific state. An 
analogy by which the two can be compared would be to say that the prior art making of a 
library and subsequent panning is like going into a library, picking up a book and hoping 
that it is the right one. In comparison, the present invention is like going into a library, 
reading all of the books in the library and finding that one book is dominant - that tiiere are 
many copies of it - and that it is relevant to the disease to be treated. 

In addition, the prior art use of library systems has a number of significant problems - in 
particular, some antibodies produced by a library may cause the death of the organism 
expressing them and therefore they simply cannot be detected. This is not a particular 
problem when e.g. looking for antibodies specific to cancers, but when one is searching 
for antibodies specific to an antigen from a pathogen which might be homologous to one 
produced by the host expression system (e.g. Escherichia coli) then important antibodies 
cannot be expressed. The use of e.g. E. coli to express libraries of e.g. human antibodies 
also suffers from the problem of codon usage - codons used by humans for specific amino 
acids can firequently not be the optimum ones for the same amino acid in E. coli or other 
host systems. This means that an important antibody might not be e7q)ressed (or at least not 
in sufficient quantities) since tiie codons in its sequence are highly inefficient in E. coli, 
resulting in the E. coli being unable to read through and express it in full. Codon 
optimisation of antibody libraries is obviously not an option since the libraries would first 
have to be sequenced, which defeats the main advantages of using libraries. Since the 
present invention sequences antibodies directly from a patient, it avoids this problem. 
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The correlation step (ii) may also correlate the occurrence of the candidate sequences 
against their occurrence in patients who have not been infected with Clostridium difficile 
or administered the vaccine, sequences only being determined to be candidate sequences 
if they or a sequence having 80% homology to them occur in total at a frequency of less 
than 1 percent in the non-infected/vaccinated patients. 

The at least one antigen produced by the micro-organism may of com^e be an immunogen. 

The present invention provides imique opportunities for understanding antibody responses 
to infection which were not previously available, and provides novel diagnostic and 
therapeutic opportunities. 

In particular, since the methods of the present invention bypass flie need to isolate an 
antigen from C. difficile, the difficulties associated with identifying e.g. complex non- 
linear epitopes, epitopes containing carbohydrate, or cryptoantigens are avoided. Instead 
tlie methods facilitate tiie determination of the identity of antibodies specific against one 
or more antigens produced by the bacterium. As is explained below, the methods of the 
present invention allow the identification of therapeutically effective antibodies and can 
allow the identification of the most important parts of antibody sequences. This is 
particularly evident when the B cells of a given patient are sampled at different time points 
during the course of an infection by C. difficile - as the patient's immune system becomes 
focussed on producing therapeutically effective antibodies against the bacterium, a 
focussing of antibody sequences is observed, and the development of variable and non- 
variable regions within the CDR parts of the VH and/or VL sequences is observed. In 
addition, the methods of the present invention can allow the identification of antibodies 
best suited to the treatment of C difficile infection in particular groups of patients, such 
as age, sex and racial groups. Similarly by conq)aring the antibody sequences of patients 
who have recovered from infection with C difficile with those that have not recovered 
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from infection, ineffective antibody sequences (which might be produced by both sets of 
patients) can be identified. 

The B cells isolated from the at least one patient can be peripheral B-cell lymphocytes 
S (PBLs), and can be isolated firom a blood sample fi-om the at least one patient B-cells can 
also be isolated firom other sources and the present invention extends to their use. For 
example, B-cells can be isolated firom spleen. 

Antibody CDR sequences are determined as detailed in the "Experiments" section using 
10 standard techniques and definitions of their start and end. 

As is detailed below, the antibody sequences are determined Scorn patients with C difficile 
infections. These sequences can be derived firom B cell immunoglobulin mRNA and hence 
reflect expressed antibodies and repertoires therein. The sequences determined according 
15 to the present invention need not be the sequences of whole antibody molecules - they 
comprise at least the VH and/or VL sequences which specify the regions responsible for 
antigen binding. 

From the nucleotide sequences determined by the initial sequencing, putative amino acid 
20 sequences for the VH and/or VL regions can be determined using standard algorithms and 
software packages (e.g. see www.mrc-lmb.cam.ac.uk/pubseq/, the Staden package and 
Gap4 programs; Rodger Staden, David P. Judge and James K. Bonfield. Managing 
Sequencing Projects in the GAP4 Environment. Introduction to Bioinformatics. A 
Theoretical and Practical Approach. Eds. Stephen A. Krawetz and David D. Womble. 
25 Human Press Inc., Totawa, NJ 075 12 (2003); Rodger Staden, David P. Judge and James 
K. Bonfield. Analysing Sequences Using the Staden Package and EMBOSS. Introduction 
to Bioinformatics. A Theoretical and Practical Approach. Eds. Stephen A. Krawetz and 
David D. Womble. Human Press Inc., Totawa, NJ 07512 (2003)). These can be further 
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characterised to determine the CDR (Complementarity Determining Region) parts of the 
VH and/or VL sequences, particularly CDRl, CDR2 and CDR3. Methods for determining 
the putative amino acid sequences and identifying CDR regions are well known and 
detailed below. 

As well as the VH and/or VL amino acid sequences, several pieces of additional 
information can be used in the correlation step: 

(i) the strain of C difficile causing the patienfs infection; 

(ii) the time point during tiie infection process at which antibody 
sequences were sampled 

(iii) patient details - none or more of: sex, race, and age; and 

(iv) the range of complementarity determining regions (CDR) ie the 
variable regions of the antibody that imdergo direct antigen 
binding/contact 

Thus in a method according to the present invention, B cells can be used which have been 
isolated from the at least one patient at a plurality of time points during infection of the at 
least one patient by C. difficile (or post-vaccination), correlation step (ii) correlating the 
time point during infection of the at least one patient by C. difficile at which the B cells are 
isolated. 

Similarly, in a method according to the present invention, B cells can be used which have 
been isolated from at least two patients, at least one of whom has recovered from infection 
by C. difficile^ and at least one of whom has not recovered from infection by C. difficile^ 
correlation step (ii) correlating the recovery of the at least two patients from infection by 
C. difficile. 
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Similarly, in a method according to the present invention, B cells can be used which have 
been isolated from at least two patients, the patients being infected by strains of C difficile 
producing the at least one antigen, correlation step (ii) correlating the sequenced at least 
the VH and/or VL coding regions of the B cells to identify a set of candidate sequences for 
antibodies, each of which is specific against at least one shared antigen produced by the 
different strains of C difficile or is specific against different antigens produced by the 
different strains of C. difficile. 

The sequencing of the VH and/or VL regions can also be used to identify the surrounding 
antibody framework, and this can be used to determine the antibody isotype. This can then 
be used in the correlation step to determme whether specific antibody isotypes are 
particularly usefril. 

By the term "therapy" is meant any treatment which is designed to cure, alleviate, remove 
or lessen the symptoms of, or prevent or reduce the possibility of contracting any disorder 
or malfrmction of the human or animal body 

The invention is suitable for identifying antibody sequence usefiil in treating C difficile 
infection. 

Using the abovementioned additional information fields in the correlation step (ii), it is 
easy to perform the correlation usmg e.g. only CDR sequences, CDRs from either a given 
patient or a given infection or both, CDRs from a given site or patient group, or CDRs 
from different sampling times over the course of infection. 



These correlations are CDR-specific because these are the hypervariable regions 
responsible for antigen binding and sequence diversity of the antibody repertoire (alfliough 



wo 2004/094474 



PCT/GB2004/001619 



-13- 

the framework regions can be used in the correlation stqp as well to reveal tiie antibody 
isotype). 

The present invention is useful in a wide range of applications, e.g. antibody therapy, and 
vaccine studies. 

Antibody Therapy: 

This involves the passive transfer of antibody to non-immune individuals (e.g. patients 
undergoing chemo-/radio-therapy, immunosuppression for organ transplantation, 
immimocompromised due to underlying conditions such as diabetes, ttaimia etc, also the 
very young or very old). 

The present invention can be used to determine the sequences of antibodies conferring 
immunity by looking for over-represented VH and/or VL sequences in patients who have 
overcome infection. These protective antibodies can be re-synthesised at the genetic level, 
over-expressed in E. coli (or other e3q)ression systems) and purified. The resultant purified 
recombinant antibody can then be administered to patients as a passive immunotherapy. 
Antibodies can also be ordered fcom commercial suppliers such as Operon Technologies 
Inc., USA (www.operon.com) by simply supplying them with the sequence of the antibody 
to be manufactured. 

Therapeutically useful sequences can be identified in a number of wa>^ which can be used 
in the correlation step (ii): 

1 ) By looking for antibody sequence over-rqpresentation in patients who 
have recovered from C. difficile infection - B cells that produce 
antibodies capable of pathogen binding undergo clonal expansion, 
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hence high frequency antibody sequences are most likely to bind 
pathogen and confer immunity. 

By following alterations in the antibody repertoire over the course of 
C difficile infection. During infection, antibodies undergo a 
maturation process to improve pathogen binding and this is 
characterised by sequence alterations. Also, B cell clonal expansion 
is more prevalent in the final stages of infection where the infection 
is cleared. The most frequent antibodies in the repertoire are chosen 
as candidates for immunotherapy. Following the maturation process 
the candidate antibody will demonstrate which key amino acid residue 
alterations improve antigen binding and this information can be used 
to improve antibody design. 

Analysis of the repertoires from different patients with C. difficile 
infection can identify shared and identical protective antibodies. 
These antibodies are attractive choices for immunotherapy as their 
occurrence in different patients suggests a strong positive selection in 
their favoui^ hence an important role in antibody-based immunity. 

Analysis of the repertoires from patients infected with different strains 
of C. difficile. In this situation, if a common antibody sequence is 
found in the repertoires for both strains, the antibody may be useful 
in treating both strains of tiie bacterium, ie displaying a broad 
spectrum. 

Analysis of the repertoires for affinity maturation of sequences. 
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Vaccination Studies: 

Vaccination protects against infection by priming tlie immune system with 
pathogen-derived antigen(s). Vaccination is effected by a single or repeated exposures to 
the patiiogen-derived antigen(s) and allows antibody matiiration and B cell clonal 
expansion witiiout the deleterious effects of the full-blown infectious process. T cell 
involvement is also of great importance in effecting vaccination of patients. The present 
invention can be used to monitor the immunisation process with experimental C. difficile 
vaccines. Subjects are given the experimental vaccine and VH and/or VL sequences are 
amplified fi:om the patient and the antibody repertoire analysed as described above. 
Qualitative and quantitative assessment of the vaccination process is possible: 

1) The VWVL repertoire of a group of patients who have been 
administered an experimental C. difficile vaccine is assessed. A 
precise molecular dissection of the resulting antibody response to the 
vaccine can be performed to determine (a) clonal expansion of 
protective antibodies, (b) protective antibody production in different 
populations (for example differing ethnic groups with different 
genetic backgrounds, as well as e.g. age and sex groups), and (c) the 
long term effect of the vaccine i.e. the antibody response over the long 
term - long term antibody memory in the immune system as well as 
any autoimmune defects; 

2) Where vaccination results in an increased ^frequency of a given 
antibody, the antibody sequence can easily be cloned and expressed 
and used in animal models of infection. If the antibody is protective, 
it may be useful in itself as an inmiunotherapy, but this also 
demonstrates that the vaccine is likely to be protective without 
subjecting the human subject to experimental infection. 
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Also provided to the present invention is a method of manufacture of a medicament for the 
treatment of a C. difficile infection which produces at least one antigen, comprising the 
steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against ttie at least 
one antigen produced by C difficile; and 

(ii) synthesising at least one antibody comprising a said candidate 
sequence specific against the at least one antigen produced by C. 
difficile. 

Medicaments and methods of treatment according to the present invention will be readily 
apparent to one skilled in the art. Medicaments may be prepared using pharmaceutically 
acceptable carriers, diluents or excipients (Remington's: The Science and Practice of 
Pharmacy (1995)Mack Publishing Company, Easton, PA, USA). The medicaments and 
methods of treatment may be effected using a pharmaceutically effective amount of the 
antibody/antigen-binding firagment. Appropriate dosages will be readily apparent to one 
skilled in the art and may be readily determined, for example by means of dose-response 
experiments 

Also provided according to the present invention is a method of treatment of an infection 
of a patient by C, difficile which produces at least one antigen, conq)rising the steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least 
one antigen produced by C difficile; 

(ii) synthesising at least one antibody comprising a said candidate 
sequence specific against the at least one antigen produced by C. 
difficile; and 
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(iii) administering a therapeutically effective quantity of said at least one 
S3mthesised antibody to said patient 

Also provided according to the present invention is a method of producing a database 
which identifies candidate sequences for antibodies specific against at least one antigen 
produced by C. difficile^ comprising the steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least 
one antigen produced by C difficile; and 

(ii) storing the data produced by said method in said database. 

Also provided is a method of generating a report which identifies candidate sequences for 
antibodies specific against at least one antigen produced by C difficile^ comprising Ihe 
steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least 
one antigen produced by C difficile; and 

(ii) producing a report comprising the data produced by said method of 
step (i). 

Also provided according to the present invention is a method for determining the efficacy 
of a vaccine, comprising the steps of: 

(i) with B cells isolated from at least one patient who has been 
administered said vaccine, sequencing at least the CDR3 region of the 
VH and/or VL coding regions of said B cells; and 

(ii) correlating said sequenced at least the CDR3 region of the VH and/or 
VL coding regions of said B cells to identify a set of candidate 
sequences for at least the CDR3 region of antibodies specific against 
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said vaccine, each of said set of CDR3 candidate sequences or a 
sequence having at least 80% homology therewith occurring in total 
at a frequency of at least 1 percent in the set of sequences determined 
at step (i). 

In certain embodiments of the invention, step (i) above may comprise the steps of: 
(i)(a) administering said vaccine to at least one patient; 
(i)(b) isolating B cells from said at least one patient; and 
(i)(c) sequencing at least the CDR3 region of the VH and/or VL coding 
regions of said B cells. 

Correlation step (ii) may comprise correlating said sequenced at least a CDR3 region of 
the VH and/or VL coding regions of said B cells with sequenced at least a CDR3 region 
of the VH and/or VL coding regions of B cells isolated from at least one patient who has 
been infected with C. difficile against which vaccination with said vaccine is intended to 
stimulate a protective immune response. 

As described above, additional information may be used in the correlation of step (ii), 
including: 

(i) the time since administration of the vaccine to the patient; 

(ii) patient details - none or more of: sex, race, and age; and 

(iii) the range of complementarily determining regions (CDR) ie the 
variable regions of the antibody that undergo direct antigen 
binding/contact 

The antibody sequences determined from patients who have been admmistered the vaccine 
can be compared with antibody sequences isolated from patients who have been infected 
with C difficile against which the vaccine is intended to stimulate a protective immune 
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response in patients. Thus it is possible to determine whether the vaccine resiilts in the 
generation of antibodies which are also produced in response to infection by C. difficile 
itself. In particular, the comparison can be made using antibody sequences determined 
from patients who have recovered from infection by C. difficile. 

In particular the method may determine the efficacy of the vaccine in stunulating a 
protective immune response against C. difficile against which vaccination with the vaccine 
is intended to stimulate a protective immune response. 

Also provided is a method of producing a database which identifies the efficacy of a 
vaccine, comprising the steps of: 

(i) performing a metiiod according to the present invention to determine 
the efficacy of said vaccine; and 

(ii) storing the data produced by said method in said database. 

Also provided according to the present invention is a method of generatmg a report which 
identifies the efficacy of a vaccine, comprising the steps of: 

(i) performing a method according to the present invention to determine 
the efficacy of said vaccine; and 

(ii) producing a report comprising the data produced by said method. 

Also provided according to the present invention is a diagnostic test method for identifying 
a Clostridium difficile infection in a patient, comprising the steps of: 

(i) with B cells isolated from said patient, sequencing at least the CDR3 
region of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or 
VL coding regions of said B cells with a set of sequences for at least 
the CDR3 region of antibodies specific against Clostridium difficile. 
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and detemuning whether each of said set of CDR3 sequences or a 
sequence having at least 80% homology therewith occurs in total at 
a frequency of at least 1 percent in the set of sequences determined at 
step (i); and 

(iii) correlating the results of comparison step (ii) to determine the 
presraice or absence of a Clostridium difficile infection in said patient 

Also provided according to the present invention is a diagnostic test metiiod for 
determining the susceptibility of a patient to Clostridium difficile infection, comprising the 
steps of: 

(i) with B cells isolated from said patient, sequencing at least the CDR3 
region of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or 
VL coding regions of said B cells with a set of sequences for at least 
the CDR3 region of antibodies specific against Clostridium difficile^ 
and determining whether each of said set of CDR3 sequences or a 
sequence having at least 80% homology therewith occurs in total at 
a frequency of at least 1 percent in the set of sequences determined at 
step (i); and 

(iii) correlating the results of comparison step (ii) to determine the 
susceptibility of said patient to Clostridium difficile infection. 

The diagnostic test method could be used to sequence the CDR3 sequences of patients in 
for example a hospital ward, to determine which patients were susceptible to infection by 
the bacterium. The presence of C difficile specific antibodies in a sample (as detennined 
by sequencing of CDR3 regions) would likely indicate that such patients were capable of 
mounting a protective immune response toward the bacterium and would not require 
antibiotic treatment In contrast, those patients identified who had not produced C. difficile 
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specific antibodies could be identified as being immunocompromised and therefore 
candidates for antibiotic treatment during the course of a C difficile infection. 

Also provided according to the present invention is a diagnostic test kit for perfomiing a 
diagnostic test method according to the present invention. The kit may contain reagents 
(e.g. PGR primers) which would enable a person skilled in the art to identify CDR3 
sequences from a patient. 



As regards antibodies usable in the present invention, particularly synthetic antibodies (the 
term "antibody" is also considered to incorporate antigen binding fragments of antibodies 
unless the context otherwise requires), the antibody may be a whole antibody or an antigen 
binding fragment thereof and may in general belong to any immunoglobulin class. Thus, 
for exan^le, it may be an IgJ^, IgG or an IgA antibody. The antibody or fragment may be 
of animal, for example, mammalian origin and may be for example of murine, rat, sheep 
or human origin. It may be a natural antibody or a fiagment thereof or, if desired, a 
recombinant antibody fragment, i.e. an antibody or antibody fragment which has been 
produced using recombinant DNA techniques. 

Particular recombinant antibodies or antibody fragments include, (1) those having an 
antigen binding site at least part of which is derived from a different antibody, for example 
those in which the hypervariable or complementarity deteimining regions of one antibody 
have been grafted into die variable framework regions of a second, different antibody (as 
described in, for example, BP 239400); (2) recombinant antibodies or fragments wherein 
non-Fv sequences have been substitiited by non-Fv sequences from otiier, different 
antibodies (as described in, for example, EP 171496, EP 173494 and EP 194276); or (3) 
recombinant antibodies or fragments possessing substantially tiie stoiictiire of a natiiral 
immunoglobulin but wherein tiie hinge region has a different number of cysteine residues 
from that found in the natural immunoglobulin but wherein one or more cysteine residues 
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in a surface pocket of the recombinant antibody or fragment is in the place of another 
amino acid residue present in the natiural immunoglobulin (as described in, for example, 
WO 89/01782 and WO 89/01974). 

5 Teachings of texts such as Harlow, E. and Lane, D, ("Using Antibodies: A Laboratory 
Manual", Cold Spring Harbor Laboratory Press, New York, 1998) further details 
antibodies, antibody fragments, their preparation and use. 

The antibody or antibody fragment may be of polyclonal or monoclonal origin. It may be 
10 specific for at least one epitope. 

Antigen binding antibody fragments include, for example, fragments derived by proteol3^c 
cleavage of a whole antibody, such as F(ab')2, Fab' or Fab fragments, or fragments 
obtained by recombinant DNA techniques, for example Fv fragments (as described, for 
15 example, in WO 89/02465). 

Where it is desired to produce recombinant antibodies according to the invention these may 
be produced using, for example, the methods described in EP 171469, EP 173494, EP 
194276 and EP 239400. Antibody fragments may be produced using conventional 
20 techniques, for example, by enzymatic digestion with pepsin or papain. 

Antibodies according to the mvention may be labelled with a detectable label or may be 
conjugated with an effector molecule, for example a drug eg. an antibacterial agent or a 
toxin or an enzyme, using conventional procedures and the invention extends to such 
25 labelled antibodies or antibody conjugates. 

The contents of each of the references discussed herein, including the references cited 
therein, are herein incorporated by reference in their entirety. 
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Where "PMID:" reference niunbers are given for publications, these are the PubMed 
identification numbers allocated to them by the US National Library of Medicine, from 
which full bibliographic information and abstract for each publication is available at 
www.ncbi.nlm.nih.gov. 

The invention will be furthCT apparent from the following description, with reference to 
the several figures of tiie accompanying drawings, which show, by way of exanq)le only, 
forms of identifying candidate sequences for antibodies specific against at least one 
antigen and of determining the efficacy of a vaccine. 

Of the Figures: 

Figure 1 shows the general principles for the isolation and DNA 
sequencing of VH and/or VL antibody gene fragments from B 
cells. Reference numeral 10 indicates primary PGR, reference 
niuneral 20 the cloning into a DNA sequencing vector using 
the T-tailing principle - random orientation, and reference 
number 30 tiie determining of the nucleotide sequence of 
forward and reverse strands using M13 forward and reverse 
primers; and 

Figure 2 shows a schematic depiction of resynthesised recombinant 
antibody gene cassette. VH and VL regions are linked with a 
glycine serine-rich linker. Each variable domain contains three 
Complementarity Determining Regions (CDRs) which 
participate in antigen binding. 
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EXPERIMENTg 

The experiments below detail methods for identifying antibody VH and/or VL CDR 
sequences that confer immunity in patients with C. difficile infection. These sequences can 
5 then be used to produce synthetic antibodies and these synthetic antibodies are suitable for 
administration to patients for therapy. Also detailed are the VH and VL CDR3 sequences 
identified by the methods. 

Also detailed are methods of determining the efficacy of a vaccine in generating a desired 
10 immune response in a patient. 

Unless stated otherwise, all procedures were performed using standard protocols and 
following manufacturer's instructions where applicable. Standard protocols for various 
techniques including PCR, molecular cloning, manipulation and sequenciag, the 

15 manufacture of antibodies, epitope mapping and mimotope design, cell culturing and 
phage display, are described in texts such as McPherson, M.J. et al (1991, PCR: A 
practical approach, Oxford University Press, Oxford), Sambrook, J. et al. (1989, Molecular 
cloning: a laboratory manual. Cold Spring Harbour Laboratory, New York), Huynh and 
Davies (1985, "DNA Cloning Vol I - A Practical Approach", IRL Press, Oxford, Ed. D.M, 

20 Glover), Sanger, F. et al (1977, PNAS USA 74(12): 5463-5467), Harlow, E. and Lane, 
D. ("Using Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory Press, New 
York, 1998), Jung, G. and Beck-Sickinger, A.G. (1992, Angew. Chem. Int. Ed. Eng., H: 
367-486), Harris, M.A. and Rae, I.F. ("General Techniques of Cell Culture", 1997, 
Cambridge University Press, ISBN 0521 573645), "Phage Display of Peptides and 

25 Proteins: A Laboratory Manual" (Eds. Kay, B.K., Winter, J., and McCafferty, J., Academic 
Press Inc., 1996, ISBN 0-12-402380-0). 
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Reagents and equipment useful in, amongst others, the methods detailed herein are 
available from the likes of Amersham (www.amersham.co.uk). Boehringer Mannheim 
(www.boehringer-ingeltheim.com), Clontech (www.clontech.com), Genosys 
(www.genosys.com), Millipore (www.milIipore.com), Novagen (www.novagen.com), 
Perkin Ehner (www.perkinehner.com), Pharmacia (www.phannacia.com), Promega 
(www.promega.com), Qiagen (www.qiagen.com), Sigma (www.sigma-aldrich.com) and 
Stratagene (www.stratagene.com). 

The term "antibody" in its various grammatical forms is used herein to refer to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antibody combining site or paratope. Such 
molecules are also referred to as "antigen binding fragments" of immunoglobulin 
molecules. 



Illustrative antibody molecules are intact immunoglobulin molecules, substantially intact 
immunoglobulin molecules and those portions of an immunoglobulin molecule that contam 
the paratope, including those portions known in the art as Fab, Fab', F(ab')2, scFv and F(v). 

The term "antibody combining site" refers to that structural portion of an antibody 
molecule comprised of a heavy and light chain variable and hypervariable regions that 
specifically binds (immunoreacts with) antigen. 

The identification and sequencing of B cells produciag antibodies and the analysis of the 
resulting data comprises the following basic steps: 

(1) Isolation of VH and/or VL coding regions from ckculatoiy B cells of 
human patients. 

(2) Determining the Nucleotide sequence of VH and/or VL repertoires. 

(3) Determination of VH and/or VL primary amino acid sequences. 
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(4) Extraction of CDR regions in silico - incorporation into the database 

(5) Detection of dominant CDR & framework regions in VH and/or VL 
repertoire. 

(6) Construction & Production of therapeutic recombinant antibodies 
from dominant antibody sequences. 

1. Isolation of VH and/or VL coding regions from circulatory B cells of human 
patients 

For tiiis method, human patients with C. difficile infection were selected as donors of 
immunised B cells. The criteria for selection were: 

(a) The patient must exhibit a pronounced antibody response to C. difficile, 
detectable for example by Westem blotting. Samples of antibodies were collected from a 
patient blood sample. Samples were diluted and tested for immunoreactivity by Westem 
blotting using antigen(s) derived from C. difficile. In such an assay, patients exhibiting 
strong antibody responses showed pronounced recognition/antibody binding of C. difficile 
antigens as detected using anti-human polyclonal detection antibodies. 

(b) The patient has survived the course of C difficile infection - this improves 
tiie chance of generating B cell responses for antibodies capable of neutralizing the 
bacterium. 

Peripheral B-cell lymphocytes (PBLs) were collected from infected patient blood samples. 
For this heparinised blood was diluted in PBS (20 ml total) and overlaid onto a 15 ml 
cushion of Ficol Hypaque (Pharmacia; unless stated otherwise, all chemicals and culture 
media were purchased from Sigma, UK) in a 30 ml centriftige tube. The PBLs were then 
collected by centiifiigation (400 x ^, 5 minutes) and washed in PBS and harvested by 
centrifugation again. RNA was prepared from PBLs using the QuickPi«p mRNA 
purification kit (Pharmacia) exactiy according to the manufacturers instiuctions. The 
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isolated mRNA was used to prepare cDNA via a reverse transcriptase reaction (Promega 
cDNA synthesis kit). For this 2^g of mRNA was re-suspended in 16 mL nuclease-free 
water and heated to 65 °C for 3 minutes (to denature secondary structure) and then 
immediately chilled on ice for 1 minute. The mRNA was then added to the following 
cocktail: 8 nL 25 mM MgCIj, 4 ^iL dNTP mix (10 mM with respect to each ribonucleotide 
triphosphate), 1 jxL RNAsin 40 u ^iL ' stock solution, 1.2 fiL AMV reverse transcriptase 
(25 u stock solution), 6 ^iL cDNA 10 pmol pi primer (see Figure 1 - cDNA 
synthesis). The mixture was incubated at 42 for 1 hour and then incubated at 100 °C 
for 3 minutes to stop the reaction. 



DNA coding for antigen binding regions was then amplified by PGR using the cDNA 
prepared ftom patients PBLs. For this, flie cDNA was used in the following PGR reaction 
to produce either heavy chain or light chain-derived antibody variable region DNA (see 
Figure 1): 2.5 yL cDNA, 33 ^iL water, 4 dNTP mix (25 mM with respect to each 
deoxynucleotide triphosphate), 5 |nL Taq reaction buffer (Perkin Ehner), 2.5 nL of an 
equimolar primer mix (final concentration of 20 pmol with respect to each primer) and 0.5 
liL Taq DNA polymerase ( 1 u Perkin Ehner Corp). The forward and reverse primers 
used in these reactions are depicted in Figure 1, and their respective nucleotide sequences 
are listed in Table 1. PGR reaction conditions were 94 °G for 1 minute, 57 °G for 1 minute 
and 72 •'G for 2 minutes for a total of 30 cycles, with an extended denaturation (94 °C for 
5 minutes) prior to cycle 1 and an additional extension step after the end of cycle 30 (72 
°C for 10 minutes). PGR was performed using a Perkin Ehner 9700 GeneAmp PGR 
machine. 5 of the PGR reaction was run on a 1% agarose gel to check the amplification 
of the expected 393 base pair (bp) product The remaining product was run on a 0.8% low 
melting point (LMP) agarose gel and the 393 bp band was excised using a clean scalpel 
blade. DNA was extracted fi:om the agarose gel sUce using a GeneGlean H Kit (Anachem, 
Luton, UK). 
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The PGR resulted in two fragment types -derived from VH and VL regions respectively, 
depending on the PGR primer sets used (for details of primers sets for antibody genes and 
a PGR schematic see Table 1 and Figure 1, respectively). 

Table 1 : Primers used in generating and sequencing VH and VL gene fragments. All 
sequences are given in the 5' to 3' direction. Use of primers is described above and in 
Figure 1. 



Table 1 



10 



Primers used m cDNA svnthesis reactions 










SEQ ID NO: 


HuIgGl-4F0R 


GTC 


CAC 


CTT 


GGT 


GTT 


GCT 


GGG 


CTT 


01 


HuCLFOR 


AGA 


CTC 


TCC 


CCT 


GTT 


GAA 


GCT 


CTT 


02 






Primers for primarv PGR 
















SEQ ID NO: 


HUJH1-2FOR 


TGA 


GGA 


GAC 


GGT 


GAC 


CAG 


GGT 


GCC 


03 


HUJH3FOR 


TGA 


AGA 


GAC 


GGT 


GAC 


CAT 


TGT 


CCC 


04 


HUJH4-5F0R 


TGA 


GGA 


GAC 


GGT 


GAC 


CAG 


GGT 


TCC 


05 


HUJH6F0R 


TGA 


GGA 


GAC 


GGT 


GAC 


CGT 


GGT 


CCC 


06 


HuVHlaBACK 


CAG 


GTG 


CAG 


CTG 


GTG 


CAG 


TCT 


GG 


07 


HuVH2bBACK 


CAG 


GTG 


AAC 


TTA 


AGG 


GAG 


TCT 


GG 


08 


HuVH3aBACK 


CAG 


GTG 


CAG 


CTG 


GTG 


GAG 


TCT 


GG 


09 


HuVH4aBACK 


CAG 


GTG 


CAG 


CTG 


CAG 


GAG 


TCG 


GG 


10 


HuVH4bBACK 


CAG 


GTG 


CAG 


CTA 


CAG 


CAG 


TGG 


GG 


11 


HuVHSaBACK 


GAG 


GTG 


CAG 


CTG 


TTG 


CAG 


TCT 


GC 


12 


HuVH6aBACK 


CAG 


GTA 


CAG 


CTG 


CAG 


CAG 


TCA 


GG 


13 


HuJKlFOR 


ACG 


TTT 


GAT 


TTC 


CAC 


CTT 


GGT 


CCC 


14 


HUJK2FOR 


ACG 


TTT 


GAT 


CTC 


CAG 


CTT 


GGT 


CCC 


15 


HUJK3FOR 


ACG 


TTT 


GAT 


ATC 


CAC 


TTT 


GGT 


CCC 


16 


HUJK4FOR 


ACG 


TTT 


GAT 


CTC 


CAC 


CTT 


GGT 


CCC 


17 


HuJKBFOR 


ACG 


TTT 


ATT 


CTC 


CAG 


TCG 


TGT 


CCC 


18 


HuVKlBACK 


GAC 


ATC 


CAG 


ATG 


ACC 


CAG 


TCT 


cc 


19 


HUVK2BACK 


GAT 


GTT 


GTG 


ATG 


ACT 


CAG 


TCT 


cc 


20 


HUVK3BACK 


GAA 


ATT 


GTG 


TTG 


ACG 


CAG 


TCT 


cc 


21 


HUVK4BACK 


GAC 


ATC 


GTG 


ATG 


ACC 


CAG 


TCT 


cc 


22 


HUVK5BACK 


GAA 


ACT 


ACA 


CTC 


ACG 


CAG 


TCT 


cc 


23 


HUVK6BACK 


GAA 


ATT 


GTG 


CTG 


ACT 


CAG 


TCT 


cc 


24 


Primers for DNA Sequencing 
















M13 forward 


GTA 


AAA 


CGA 


CGG 


CCA 


GT 






25 


M13 reverse 


AAC 


AGC 


TAT 


GAC 


CAT 


G 






26 



15 



20 



25 



30 



35 



40 
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VH and/or VL gene fragments were cloned into cloning vector pGEM-T easy (Promega 
Corporation) to &cilitate DNA sequencing. For this 3 fig PGR product was prepared for 
restriction using QIAquick PGR purification spin colimms (Qiagen, UK) according to the 
manufacturers instructions. DNA was eluted from the spin column in 40 \iL buffer EB. 
5 Purified PGR product (2 \iL) was mixed with 1 jiL pGEM-T easy vector, 6 fiL water and 
1 iiL DNA ligase and the mixture ligated for 1 h at room temperature. Ligations were then 
transformed into electrocompetent^. coli TGI cells (Stratagene) by electroporation, and 
plated out onto agar plates containing Ampicillin 100 p.g ml"' IPTG (100 \iM) and X-gal 
(0.006 % w/v). Golonies were allowed to grow overnight at 37 °C and then stored at 4 "G. 
10 Recombinant colonies are identified as white colonies on this media. 

2. Determining the nucleotide sequence of VH and/or VL repertoires 

DNA was first prepared from VH and/or VL recombinant E. coli clones. For this, 
individual colonies were each transferred to 1.2 ml LB broth supplemented with 

15 Ampicillin 1 00 fig ml"* using a 96 well plate format Gultures were the grown at 37 °G for 
24 hours. Bacterial cells were harvested by centrifugation at 4,000 x g, 30 minutes and the 
supematants discarded. Plasmid DNA was prepared using Wizard SV 96 plasmid 
purification kits (Promega Gorporation) essentially following the manufacturer's 
instructions. Yields of plasmid DNA were typically in the order of 5 \ig per 1.2 ml starter 

20 culture. 

DNA sequencing reactions were performed usmg the DYEnamic ET dye terminator cycle 
sequencing kit (Amersham Pharmacia Biotech). Purified plasmid DNA (0.5 ^g) was mixed 
with 8 \iL DYEnamic ET terminator reagent premix and 1 fiL M13 forward or reverse 
25 primer (5 \iM) in a total reaction volume of 20 \iL. Thermal cycling was then performed 
using a GeneAmp PGR system 9700 (Peridn Ehner) with the following parameters: 95 ''G, 
20 seconds; 50 °G, 15 seconds; 60 **G, 1 minute; 30 cycles). Reactions were performed 
using 96 well format non-skirted ELISA plates (AB Gene). Unincorporated dye terminator 
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were removed using precipitation. For this, etiianol samples were mixed with 2 fiL 7.5 M 
ammonium acetate and 55 \iL of 100 % ethanol and centrifuged at 3000 g for 30 minutes. 
DNA pellets were washed with 70 % ethanol and re-suspended in 20 jiL loading solution. 
Reactions were sequenced using a MegaBACE 1000 DNA sequencer (Amersham 
Pharmacia Biotech) following the manufacturers instructions (2 kV injection voltage for 
30 s with electrophoresis at 6 kV for 200 minutes). Chiomatograms were exported using 
the .scf file format for finishing and archiving. 

3. Determination of VH and/or VL primary amino acid sequences 

DNA sequences of VH and/or VL were determined in both forward and reverse strands 
(using M13 forward and reverse primers respectively) and compared in oider to highlight 
discrepancies and maintain a high degree of accuracy. This was done using the Staden 
suite of programs (Staden, R. (1996) The Staden Sequence Anafysis Package. Molecular 
Biotechnology 5, 233-241). First, sequence .scf files were entered into the PREGAP 
program where vector sequence was stripped off and the quality of the sequence was 
assessed. Poor quality sequences where the DNA sequence was ambiguous were rejected 
and re-sequenced. Forward and reverse strands were matched using the GAP program to 
highlight and resolve areas containing sequencing artefacts. The orientation of the VH 
and/or VL sequence was noted and reverse complemented where necessary to produce only 
forward reading firame orientations for translation. VH and VL gene sequences were then 
translated into amino acid sequences. Each sequence was given a unique identifier (name). 

4. Extraction of CDR regions in sUico - incorporation into the database 

A general teaching of identifying CDR regions is at www.bioinf.org.uk/abs/ 

The following set of rules will allow the definition of the CDRs in an antibody sequence. 
There are rare examples where these virtually constant features do not occur (for example 
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the human heavy chain sequence EU does not have Trp-Gly after CDR-H3). The Cys 
residues are the best-conserved feature. 



CDR-Ll 

Start 

Residue before 
Residue after 

Length 



Approx. residue 24 
always a Cys 

always a Tip. Typically Trp-Tyr-Ghi, but also, Tip-Leu-Ghi, 
Trp-Phe-Gln, Trp-Tyr-Leu 
10 to 17 residues 



CDR-L2 
Start 

Residues before 
Length 



always 16 residues after the end of CDR-Ll 
generally Ile-Tyr, but also, Val-Tyr, Ile-Lys, Ile-Phe 
always 7 residues 



CDR-L3 

Start 

Residue before 
Residues after 
Length 



always 33 residues after end of CDR-L2 
always Cys 

always Phe-Gly-Xaa-Gly (SEQ ID NO: 92) 
7 to 1 1 residues 



CDR-Hl 

Start 

Residues before 
Residues after 
Length 



Approx. residue 26 (always 4 after a Cys) 

always Cys-Xaa-Xaa-Xaa (SEQ ID NO: 93) 

always a Tip. Typically Trp-Val, but also, Tip-Ile, Tip-Ala 

10 to 12 residues 



CDR-H2 
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Start 

Residues before 

Residues after 
Length 



always 15 residues after the end of CDR-Hl 

typically Leu-Glu-Trp-fle-Gly (SEQ ID NO: 94), but a number of 

variations 

Lys/Arg-Leu/HeA^al/Phe/Thr/Ala-Thr/Ser/De/Ala 
16 to 19 residues; 



CDR-H3 

Start 

Residues before 
Residues after 
Length 



always 33 residues after end of CDR-H2 (always 2 after a Cys) 
always Cys-Xaa-Xaa (typically Cys-Ala-Arg) 
always Trp-Gly-Xaa-Gly (SEQ ID NO: 95) 
3 to 25 residues 



5. Detectioii of dominant CDR & framework regions in VH and/or VL repertoire 

The database was constracted using SQL Server Database software (Microsoft). This 
allowed the database to be queried using SQL and allowed CDRl, CDR2 and CDR3 
sequences to be extracted firom any range of database VH and/or VL sequences in FASTA 
format. Extracted CDRs were then subject to multiple alignment using CLUSTAL X in 
such a way so that identical or very similar CDRs are grouped together in blocks 
(Thompson, J.D., Higgins, D.G. and Gibson, TJ. (1994) CLUSTAL W: improving the 
sensitivity of progressive multiple sequence alignment through sequence weighting, 
positions-specific gap penalties and weight matrix choice. Nucleic Acids Research, 
22:4673-4680). From this the firequency of recurring CDRs was determined and CDR- 
dominance in the immunoglobulm repertoire was established. Alternatively, a graphical 
interface was employed using Probiosys (proBionic EMEA, Amsterdam). ProBiosys 
prepares dendrograms from CLUSTAL .phy files and aUows a rapid visual interpretation 
of the relationships among large numbers of aligned sequences. The whole process was 
performed for CDRs from the same patient sampled at different points during an ilhiess, 
for CDRs from patients with different infections, or from sex-matched, age-matched or 



wo 2004/094474 



PCT/GB2004/001619 



-33- 

race-matched patient groups, depending on the search and the additional infonnation added 
to the database entries. 



6. Construction & production of therapeutic recombinant antibodies from dominant 
antibody sequences 

Once dominant antibody sequences have been recognised in a given repertoire, the 
information can be used to infer the presence of CDRs and frameworks that confer 
unmunity. Selected VH and/or VL sequences can be identified and their gene sequences 
resynthesised usmg synthetic oligonucleotides. This is inq)ortant as it also allows the 
human-derived gene sequence to be codon-optunised for E. coli in order to improve 
protein expression. Dominant VH and/or VL sequences can be spliced together using a 
spacer (Unker) sequence (Figure 2). This gene cassette, termed a scFv, can be resynthesised 
to include terminal Ndel and Notl restriction sites for cloning into expression vector 
pET29b (Novagen). ScFv DNA can be cut with Ndel (cuts in VH) or Notl (cuts m VL) 
using the following reaction: 40 jil DNA (4 jig), 10 fil of Restriction enzyme buffer D 
(Fromega), 47 jil water and 2 fil Ndel and Notl enzyme (10 u jil"^; Promega) with 
digestion for 4 h at 37 °C. DNA can be then fractionated on 0.7% agarose TAB gels and 
the digested DNA excised from the gel and purified from the agarose slice using the 
Geneclean n kit (Bio 101) exactly accordmg to the manufecturers instruction. pET29b 
vector DNA can be cut with Ndel and NoA as described above. 1 ng vector can be nuxed 
with 1 ng restricted VL DNA resuspended in 8.5 ^1 water and ligated by addition of 1 fil 
10 X ligation buffer (Boehringer Mannheim) and 0.5 \i\ DNA ligase (3 u ; Boehringer 
Mannheim), followed by hgation overnight at 14 "C. 

The entire ligation can be transformed mto E. coli TG2 cells (Stratagene, UK) by 
electroporation. Transformants can be plated out onto agar plates containing Tetracycline 
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(25 \ig ml ' - selective for recA ::TnlO on the chromosome of TG2) and Kanamycin (50 
ml-' - selective for pET29). Recombinant plasmid DNA prepared from individual 
clones can be checked for scFv sequence by digestion with Ndel and NotI as described 
above. 

Recombinant scFvs can be over-e3q)ressed in E. coli strain JM109(DE3) and purified thus 
enabling biochemical and biological characterisation. Recombinants can be spread on LB 
plates supplemented with 50 ng ml ' kanamycin and a single colony can be used to 
inoculate a 10 microlitre starter culture of LB broth with 50 fig mr' kanamycin. For this, 
a 1 L expression culture can be prepared in the presence of kanamycin for 4-5 h at 37 °C 
with shaking at 300 ipm. At OD«,o =1, induction can be perfonned using IPTG and the 
cells can be grown witii vigorous aeration for a further 3-5 h. Cells can then be harvested 
by centrifugation (4000 x g, 15 minutes, 4 °C). The cell pellet can be resuspended in 10 
ml fresh (4 **C) lysis buffer (below) and stored at -20 ° C overnight prior to cell breakage 
using the 25 ml X-Press system (AB Biox) 

LYSIS BtJFFRR 

50mMTrisHClpH8.0 
1 mM BDTA 
100 mM KCl 

0.1 mM AEBSF (amino ethyl benzene sulphonic acid) 

The lysate can be centrifoged in an Oakridge tube (24 300 x g, 4 "C, 30 minutes). The 
pellet can be resuspended in 20 ml of ice cold lysis buffer on ice using a Silverson lab 
blender. The lysate can be spUt into 8 x Oakridge tubes and each aliquot can be diluted to 
30 ml (total 120 ml) in ice-cold Lysis Buffer. Inclusion bodies can be pelleted (24 300 x 
g, 4 °C, 30 minutes) and the supernatant discarded. Each of the 8 pellets can be 
resuspended in 30 ml of lysis buffer, and the inclusion bodies can be harvested by 
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centrifiigation again (24 300 x g, 4 °C, 30 minutes). This wash/centrifugation step can be 
perfonned five times in total to clean fbs inclusion body firaction. Each pellet can be 
resuspended in 15 ml of ice-cold water and the inclusion bodies can be stored at -20 °C. 

For refolding, 200 ml solution of 2% (w/v) N-lauryl sarcosine (NLS) in 50 mM Tris HCl 
pH 9.0 can be prepared as follows. 4 g NLS can be solubilised in 100 ml water with 
stirring. 10 ml of IM Tris pH 9.0 stock solution can be added and made up to 195 ml with 
water. 5 ml of inclusion body shmy can be added and stirred vigorously for 30 minutes at 
room temperature. 

CUCI2 can be added to a concentration of 100 jiM. This serves as a catalyst for oxidation. 
The refolding reaction can be transfmed to 4 °C and stirred vigorously for 2 days to 
promote aeration. The refolding reaction cari be vacuum filtered through a 0.44 fxM 
vacucup bottle top filter unit (90 mm diameter, Gelhnan Sciences) and the filtrate 
transferred to a Pellicon Labscale TFF system fitted with PLGCIO membrane unit 
(Millipore). The reaction can be concentrated to 25 ml using tangential flow, discarding 
the permeate (the scFv is localised to the retentate). The solution can be diafiltered against 
40 X turn-over volumes (1 L) of 10 mM ammonium acetate (AAT) pH 9.0. Finally, the 
volume of the antibody can be diluted to 50 ml using 10 mM AAT pH 9.0. The buffer 
exchanged antibody can be stored for 2 hours at 4 "C. The typical protein content was 1 - 
2 mg ml"' with a yield of up to 50 mg per litre. 

For scFv purification a 10 ml bed volume of Ni NTA superflow agarose Qiagen can be 
prepared according to the manufacturers instructions using a 10 ml glass colimm (Sigma) 
without flow adaptors. The column can be equilibrated with 50 ml Buffer B (6M urea, 0.1 
M NaH2P04, 10 mM Tris HCl pH 8.0). Refolded scFv (as described above)can be diluted 
1/5 in Buffer B and applied directiy to the column at ml min'* . 50 ml buffer B can be 
applied to tiie column (flow rate 5 ml min'*) followed by 50 ml Buffer C (same 
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composition as buffer B but pH 6.3). The purified scFv can be eluted ftom the column by 
slowly applying Buffer B supplemented with 250 mM high grade imidazole until the 
< 0.05. 

Results 

The variable heavy chain (VH) and variable light chains of the antibody genes fiom 
peripheral blood lyn:q)hocytes (PBLs) isolated fix)m C. difficile infected patients have been 
TA cloned and sequenced. In each case the third complem^tanty determining region (i.e. 
CDR-H3 or CDR-L3) was identified. This region is both the most variable region of the 
VH chain and the area identified as being most important in determining antig^ binding, 
for this reason this area is used as the signature for each VH chain. 

Clinical Information 

Patient DOl - Variable heavy chains (VH's) and variable light chains (VL's) were 
sequenced firom PBLs taken ten days after the onset of CDAD. Lab results confirmed C. 
difficile toxin was present 

Patient D02 - VH's and VL's were sequenced fi-om PBLs taken ten days after the onset of 
CDAD. Lab results confirmed C. difficile toxin was present. In the seven subsequent 
weeks after sample was taken, the patient did not develop any fiirther C. difficile 
infections, suggesting that the antibodies produced in response to the infection were 
protective. 

Patient D03 - VH's and VL's were sequenced firom PBLs taken ten days after the onset of 
CDAD. Lab results confirmed C. difficile toxin was present In the three subsequent weeks 
after sample was taken, the patient did not develop any fijrther C difficile mfections, 
suggesting that the antibodies produced in response to the infection were protective. 
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Sequencing results 

Patient DOl showed a high degree of focusing with both VH's and VL's. Three different 
CDIG sequences make-up over 80 % of the sequenced VH's. VH's with the CDR3 
sequence of SEQ ID NO: 27 account for 57.8 % of sequenced VH's from this patient. 
5 VH's with the CDR3 sequence of SEQ ID NO: 28 account for 1 3.5 % of all VH's, and 
VH's the CDR3 sequence of SEQ ID NO: 29 account for 10 % of all sequenced VH's in 
Has patient. Three different CDR3 sequences also make up the majority of sequenced 
VL's, these are the CDR3 sequences of SEQ ID NOs: 32, 34, and 33 (occurring at 
frequencies of 32.7 %, 15.3%, and 6.6%, respectively). 

10 

Similarity searches were performed using these common VH and/or VL sequences against 
other VH and/or VL sequences contained within a database (FABTEC database), which 
contains VH and/or VL regions of PBLs isolated from patients with methicillin-resistant 
Staphylococcus aureus (MRSA), Candida albicans, Pseudomonas aeruginosa (the 

IS causative agent of cystic fibrosis), Streptococcus oralis and vancomycin-resistant 
Enterococci (VRE) infections and to identify antibody sequences which are associated with 
resistance to the infections and which can therefore be used to effect therapy against those 
mfections. These sequences have formed tiie basis of WO 01/76627. For all of these, the 
isolated VH and/or VL sequences demonstrate skewing of the antibody repertoire over the 

20 course of infection, thus revealing the identity of matured, immimity-conferring VH and/or 
VL sequences. This has typically been done using 5,000 to 15,000 sequences for each VH 
and/or VL. VH and/or VL repertoires from healthy individuals can also be used for 
comparison purposes. 

25 Using the BLOSUM50 matrix it was possible to pick out groups of antibodies with 
CDR3's that had greater than 70% similarity. The CDR3 sequences of SEQ ID NOs: 27 
and 28 (Table 2 and 3, respectively), showed sequence similarity of greater than 70% only 
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with other VHs from C. difficile infected patients VH's sequenced from individuals with 
other infections (C. albicans, MRS A and VRE) showed no similar CDR3 sequences. 

In the following Tables, libraries starting D are derived from C difficile infected patients, 
libraries starting M are derived from MRSA infected patients, libraries starting V are 
derived from VRE infected patients; and libraries starting C are derived from C. albicans 
infected patients. 



Table 2 



CDR3SEQroNO: 


Similarity 


Library 


No. of clones 


27 


100.0 


DOl 


184 


27 


100.0 


D03 


5 


35 


100.0 


DOl 


1 


36 


100.0 


DOl 


3 


37 


100.0 


DOl 


1 


37 


100.0 


D03 


1 


38 


93.75 


DOl 


1 



Tables 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


28 


100.0 


DOl 


43 


28 


100.0 


D03 


1 


39 


100.0 


DOl 


1 


40 


88.88 


DOl 


1 



VH's with the CDR3 sequence of SEQ ID NO: 29 showed similarity to VH's from 
other C. difficile infected patients and MRSA infected patients (Table 4). 
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Table 4 



CDR3 SEQIDNO: 


Similarity 


Library 


No. of clones 


41 


100.0 


DOl 


1 


29 


100.0 


DOl 


32 


29 


100.0 


MOl 


1 


29 


100.0 


M02 


5 


29 


100.0 


M03 


2 


29 


100.0 


M04 


8 


42 


100.0 


M03 




43 


100.0 


DOl 




44 


93.75 


DOl 




45 


93.75 


DOl 




46 


75.0 


M04 




47 


70.58 


M05 





The CDR3 sequences from the VL's of patient DOl have also had a similarity search 
performed on them. VL's with the CDR3 sequences of SEQ ID NOs: 32 and 33 (Tables 
5 and 6, respectively) show a high degree of homology with CDR3 sequences from C. 
difficile infected patients but no homology to CDR3 sequences derived from patients with 
different clinical infections. VL's with the CDR3 sequence of SEQ ID NO: 34 showed 
similarity to VL's isolated from C. difficile, MRSA and VRE infected patients (Table 7). 
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Table 5 



CDR3 SEQIDNO: 


Similarity 


Library 


No. of clones 


32 


100.0 


D51 


64 


48 


90.9 


D52 


2 


49 


90.9 


D52 


1 


50 


90.9 


D52 


1 


51 


90.0 


D51 


1 


52 


90.0 


D52 




53 


81.8 


D52 




54 


81.8 


D52 




55 


81.8 


D52 




56 


81.8 


D52 




57 


80.0 


D52 




58 


72.7 


D52 




Table 6 


CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


33 


100.0 


D51 


13 


59 


90.9 


D52 


1 
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Table 6 continued 



CDR3 SEQIDNO; 


Similarity 


Library 


No. of clones 


60 


83.3 


D52 


1 


61 


81.8 


D52 


4 


62 


81.8 


D52 


1 


63 


81.8 


D52 




64 


81.8 


D52 




65 


81.8 


D51 




66 


81.8 


D52 




67 


81.8 


D52 




68 


72.7 


D52 




69 


72.7 


D52 





Table 7 



CDR3 SEQIDNO: 


Similarity 


Library 


No. of clones 


34 


100.0 


D51 


30 


34 


100.0 


M56 


2 


34 


100.0 


V54 


2 


70 


88.9 


D51 


1 


71 


88.9 


D51 


1 


72 


88.9 


V54 


2 


73 


77.9 


M55 


1 


74 


77.9 


D51 


1 


75 


77.9 


M55 


1 


76 


77.9 


M55 1 


1 
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Patient D02 also showed a high degree of focusing of the VH repertoire, although focusing 
is less obvious in the VL repertoire. Two types predominate in the VH library of this 
patient. VH's with the CDR3 sequence of SEQ ID NO: 30 accounted for 45.6% of 
sequenced clones and those with the CDR3 sequence of SEQ ID NO: 31 accounted for 
8.8% of the library. In the VL library the three most frequently isolated clones with the 
CDR3 sequences of SEQ ID NOs: 61, 96, and 97 were found at 4%, 4% and 3%, 
respectively. 

Similarity searches in the FABTEC database showed the two common VH CDR3 
sequences to be specific to C. difficile patients (Tables 8 and 9). VH's with the CDR3 
sequence of SEQ ID NO: 30 match with other antibodies in that library and antibodies 
from library D03 (Table 8). VH's with the CDR3 sequence of SEQ ID NO: 3 1 match with 
antibodies from library D03 (Table 9). 



Table 8 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


77 


100.0 


D02 




78 


100.0 


D02 




30 


100.0 


D02 


52 


30 


100.0 


D03 




79 


100.0 


D02 




79 


100.0 


D03 




80 


94.4 


D02 




81 


94.4 


D02 




82 


94.4 


D02 




83 


88.8 


D02 




84 


77.7 


D03 
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Table 9 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


31 


100.0 


D02 


10 


31 


100.0 


DOB 


4 


85 


90.0 


D03 


1 


86 


90.0 


D03 


1 



Similarity searches using some of the VL's showed a clustering of sequences. 15% of VL's 
from patient D02 (Ubrary D52) showed high degree of similarity to the CDR3 sequence 
of SEQ ID NO: 48, VL's from patient DOl (Ubrary D51) also showed a high degree of 
homology to this sequence (Table 10). In a similar way a cluster around the sequence of 
SEQ ID NO: 61 accounts for 13% of patient D02 sequenced VL's, again clones similar to 
this one are also found in patient DOl (Table 11). Neither of these clones show any 
significant similarity to VL CDR3 sequences isolated from patients infected with other 
pathogens. It should also be note that using the BLOSUM50 matrix these two clones show 
greater than 60% similarity to each other. 



Table 10 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


48 


100.0 


D52 


2 


49 


100.0 


D52 


1 


53 


100.0 


D52 


1 


32 


90.9 


D51 


64 


87 


81.8 


D52 


2 


51 


81.8 


D51 


1 


88 


81.8 


D52 


1 
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Table 10 continued 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


57 


72.7 


D52 


1 


89 


72.7 


D52 


1 


50 


72.7 


D52 


1 


54 


72.7 


D52 


1 


55 


72.7 1 D52 


2 


Table 11 


CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


61 


100.0 


D52 


4 


65 


100.0 


D51 




59 


81.8 


D52 




33 


81.8 


D51 


13 


62 


81.8 


D52 




63 


81.8 


D52 




60 


75.0 


D52 




64 


72.7 


D52 




66 


72.7 


D52 




68 


72.7 


D52 




67 


72.7 


D52 





Patient D03 has only preliminary results for VH's but shows many similar sequences to 
both DOl and D02 so are worth reporting. Only data from 49 VH sequences have been 
produced. 10% of the clones have the CDR3 sequence of SEQ ID NO: 27, this is the 
sequence of the most common VH in patient DOl. 8% of VH's have the CDR3 sequence 
of SEQ ID NO: 31 - this is the second most common VH CDR3 sequence in patient D02. 
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Cross-Library comparison (Table 12) of tlie CDR3 sequences of VH's has shown 8 CDR3 
sequences which appear in more than one of the C. difficile infected patients. The CDR3 
sequences of SEQ ID NOs: 27, 28, and 37 appear in patients DOl and D03 and the 
sequences of SEQ ID NOs: 31, 90, 91, 79, and 30 all appear in patients D02 and D03. 
5 So far patients DOl and D02 show no common CDR3 sequences. VL's have only been 
sequenced from patients DOl and D02, these patients share no identical CDR3 sequences. 



Table 12 



VHCDR3SEQIDNO: 


Patient 


27 


DOl and D03 


28 


DOl and D03 


37 


DOl and D03 


31 


D02andD03 


90 


D02 and D03 


91 


D02andD03 


79 


D02andD03 


30 


D02andD03 



Summary 

20 The presence of a large number of a particular VH CDR3 sequence in a patient with a 
C. difficile infection indicates that VH as part of an scFv may be protective against the 
organism. CDR3 sequences that are shared by more than one patient indicate specific 
C. difficile sequences. It should also be noted that sequences with a high degree of 
homology often due to somatic mutation within the patient may also represent 

25 important sequences with similar but subtly different properties. Analysis of the CDR3 
sequences of the VH from our individuals has lead to the identification of 17 
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potentially protective VH's. These VH CDR3 sequences correspond to SEQ ID NOs: 
30, 79, 28, 91, 41, 39, 38, 27, 36, 36, 40, 37, 90, 29, 44, 45, and 43. 

A similar analysis of the VL has identified 44 different VL CDR3 sequences. The larger 
5 niimber is due to the increased rate of somatic mutation in antibody light chains leading 
to VL's with slightly different sequences. These VL C3DR3 sequences correspond to SEQ 
ID NOs: 59, 60, 61, 33, 62, 63, 64, 65, 66, 77, 84, 78, 30, 79, 80, 81, 82, 87, 57, 74, 85, 31, 
86, 82, 51. 48, 88, 49, 53, 32, 52, 70, 71, 34, 89, 68, 67, 50, 54, 55, 56, 59, 58, 96, and 97. 



